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Abstract. We describe the extension to multiple datasets of a coherent method for the search 
of continuous gravitational wave signals, based on the computation of 5-vectors. In particular, 
we show how to coherently combine different datasets belonging to the same detector or to 

O , different detectors. In the latter case the coherent combination is the way to have the maximum 

increase in signal-to-noise ratio. If the datasets belong to the same detector the advantage 

C/2 ' comes mainly from the properties of a quantity called coherence which is helpful (in both cases. 



in fact) in rejecting false candidates. The method has been tested searching for simulated signals 
injected in Gaussian noise and the results of the simulations are discussed. 
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r^ ' 1. Introduction 

The search for continuous gravitational wave signals, like those emitted by a deformed rotating 
neutron star, is an important target of the analysis of data produced by interferometric detectors, 
p^l ■ like GEO P , LIGO [2] and Virgo [3] . In the so-called targeted search the source parameters (sky 

position, frequency and frequency derivatives) are assumed to be known. This is the case, .e.g., of 
the search of signals from known pulsars for which radio-astronomical observations can provide 
very accurate measures of the parameters themselves. For this kind of search coherent data 
analysis methods, which precisely correct the signal phase variations due to Doppler, spin-down, 
C^_' Einstein and, if necessary, Shapiro effects, allow in principle to approach the best sensitivity 

at a relatively small computing cost. Various coherent pipelines have been developed and have 
been used (and will be used) to analyze data from Virgo and LIGO scientific runs, see e.g. |1]. 
In particular, we consider here a method which goes through the computation of the so-called 
5-vectors [5J which has been used in the analysis of Virgo VSR2 data for the search of continuous 
gravitational wave signals from the Vela pulsar [4j. 

Given the expected weakness of continuous signals, it is important to use the available data, 
which are typically affected by various kinds of disturbances, in the most effective way. This is 
done in the 5-vectors method by applying different techniques. For instance, a cleaning procedure 
is used both on the initial time domain data [B] and after Doppler, spin-down and Einstein 
effect corrections, in order to remove particularly disturbed periods. Moreover, a Wiener filter 
is applied for taking into account slow non-stationarities so to weight less the more noisy periods 



[3]. In this paper we describe one more analysis step that can be apphed in order to improve 
the sensitivity of the search: if more datasets are available, belonging to the same detectors (e.g. 
two different runs) or to different detectors, they can be properly combined in order to increase 
the output signal-to-noise ratio and to improve rejection of false candidates. Other methods of 
coherently combining data from different detectors have been developed in the past, like e.g. 
the network analysis using the ^-"-statistic [7] and a complex heterodyne method [8|. 

The plan of the paper is the following. In Sec.([2|) the basic concepts of the 5- vectors method 
are summarized and the definition of signal-to-noise ratio is given. In SecdS]) the coherent 
combination of more datasets is introduced and tests done with simulated data are described. 
Sec.(I3D is devoted to discuss further future developments of the method. 

2. The 5-vectors method 

Here we briefly recall the 5-vectors method and define the signal-to- noise ratio (SNR). More 
details can be found in [5] and 0]. Let us indicate the data to be analyzed with 

x'{t) = n'{t) + h'{t) (1) 

where n'{t) is the noise and h'{t) is a continuous gravitational wave signal. The data, after 
the extraction of a small band around the expected frequency of the signal, are corrected for 
Doppler, spin-down and Einstein effects by a re-sampling procedure described in detail in [3] 
and which effectiveness is demonstrated by the very good recovery of parameters of hardware 
injected pulsar signals in Virgo VSR2 data as discussed in [1]. Let us indicate the corrected 
data with x{t) = n{t) + h{t). At this stage a signal present into the data is monochromatic 
apart from an amplitude and phase sidereal modulation. The gravitational wave strain at the 
detector can be then written as 

h{t) = Ho{H+A+{t) + H^A"" (t))e^('^«*+*«) (2) 

(where the operation of taking the real part is understood in order to have the physical strain) 
In this equation the two amplitudes are given by 

cos {2ip) — jrj sin {2iIj) 

sin (2^^) + jr] cos (2^^) . . 

where r] is the ratio between the minor and the major semi-axis of the wave polarization ellipse 
(r/ = for a linearly polarized signal, 77 it 1 for a circularly polarized signal) and ip is the 
polarization angle defining the direction of the ellipse semi-major axis respect to the celestial 
parallel of the source (measured counterclockwise). The functions A'^{t), A^ {t) are equal to the 
standard beam-pattern functions F'^{t; ip), F^ (t; ip), defined e.g. in [9], computed at -0 = 0. The 
power of the signal in Eq.(l2]) is spread among the five angular frequencies a;o, ujq ±0,, wq ± 2r2, 
where fi is the Earth sidereal angular frequency. 

The noise n{t) is assumed to be gaussian with mean /i = and standard deviation a. 
The signal detection and parameter estimation procedure is based on the computation of data 
and signal 5-vectors, defined as their Fourier components at the 5 frequencies in which the 
signal is split by the sidereal modulation. More precisely, given a generic time series /(t), the 
corresponding 5-vector is a complex vector 

/(t)e-^'('^o-''^)*dt (4) 

T 



where k = [—2, —1, ..., 2] and T is the observation time. The signal 5-vector is defined as 

A = {H+A+ + Hy.A'') (5) 

Note that for a source with known position and frequency and for a given detector A+ and A^ 
can be computed from Eq.@ while Hq, tj, ^, $o fire in general unknown (for a few pulsars, 
like Vela and Crab, astronomical observations provide estimations of r] and ip [10] )• The data 
5-vector can be expressed as 

X = //oe^'*" {H+A+ + i/x A^) + N = Hoe^'^'^A + N (6) 

where N is the noise 5-vector. It is important to stress that data or signal 5-vectors contains 
exactly the same information of the corresponding time series. Once the 5-vectors of data, X, 
and signal. A"*" and A^, have been computed the following complex amplitude estimators are 
built: 
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X • A^ 
^x = ^ (T) 

where the scalar product among two complex vectors a and b, each with m components, is 
defined as a • b = J^T^i^^i^t- H^ and Hx are the estimation of, respectively, Hqc^^^H^, and 
Hqc^^^Hx and are used to compute the detection statistics 
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which is compared to its expected distribution in presence of noise alone to establish if a 
statistically significant candidate is present. In case of detection Eq.Q is also used to estimate 
source parameters. In particular, the estimation of the signal amplitude is given by 



/70 = Vl^+I' + I^x|2 (9) 

A preliminary comparison of our detection statistic with the J^-statistic is done in . A more 

detailed comparison, also with other detection statistics like the i3-statistic introduced in [llj, 

will be done in a forthcoming paper. 

If a detection is claimed we can quantify its reliability by a parameter called coherence defined 

as [3] 

i^A , , 

c = ^ (10) 

where A = H-^A'^ -\- HxA^ is the estimated total signal 5-vector and H = ^^- The coherence 

is a number between and 1 which measures the resemblance between the shape of the expected 
signal and the data. In fact it does not depend on scaling factors on the signal but just 
on its shape. We have numerically found that in the absence of any signal the coherence is 
described (assuming all the source polarization parameters are unknown) by a /3 distribution 
with parameters ai = 2, 02 = 3: 

12c • fl - c)'^ 

p{c; ai = 2, as = 3) = — \- ^- (11) 

n[ai = 2, 02 = 0) 



where -6(01,02) is the j3 function which acts as a normahzation constant. Large values of 
coherence have a smah probabihty of being produced by noise alone. We will further understand 
the utility of the coherence when extending the analysis to more datasets. 

If no significant candidate is found an upper limit on the signal amplitude can be computed 
through a Monte Carlo simulation [3]. 

The 5-vectors method presents various strong points. For instance, the computation of cross 
products among data and signal templates is very fast because it involves only 10 complex 
numbers instead of o(lO^) as it would be working in the time domain and this, in conjunction 
with the re-sampling procedure used to correct the Doppler effect, is particularly useful when 
doing simulations. Moreover, the re-sampling procedure allows to easily extend the method to 
the so-called 'narrow band' searches in which a possible mismatch between the electromagnetic 
and gravitational wave signal can be accounted for. As we will see in SecEl another advantage 
is the straightforward extension to a network of detectors, which is the main topic of this paper. 



2.1. Signal-to-noise ratio 

In the absence of signals, assuming the noise has a Gaussian distribution with mean zero and 
variance o"^, each component Xj ii = 1, ..5) of the data (noise) 5- vector is distributed as a 
complex gaussian variable, with ^x = and a\ = a^T, as it can be immediately seen from 
Eq.dH) (i.e. the real and imaginary parts are two independent gaussian variables with mean zero 
and variance a'^T/2). The expectation value of |Xp is £'[|Xp] = Yl,i=i l-^iP = 5a'^T. Using 
Eq.© it is easy to see that the two amplitude estimators also have a gaussian distribution with 
mean zero and variance 
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Let us define the signal-to-noise ratio (SNR) at the filter output as the ratio between the 
estimated signal amplitude and the standard deviation of the estimation when only noise is 
present: 

5iV«=A = _^^np= (14) 

From the definition of 5- vector, see Eq.Q, we see that each component of the signal 5-vectors 
A^ and A^ increases linearly with the observation time T. Hence, from Eq. ffH 
SNR oc vT, as expected for a coherent analysis. 



we have 



3. Extension to more datasets 

Let us consider n datasets. Let us assume for simplicity that they have all the same length, 
the same noise level and belong to the same detector. Let us compute the corresponding 
5-vectors, Xj, i = l,...n. Given a source target let us compute also the signal 5-vectors, 
A^, A^ , i = 1, ...n. We can now put together the n 5-vectors of each kind in one 5n- vector: 

X = [Xi,X2, ...x„] 

A+ = 



A+ A + 
-^1 ) -^2 ' 
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(15) 



In this simple case |A^ 



\Aj I 



\/i,j = l,...n (because the datasets belong to 



the same detector and have the same length) so that 
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Then from Eq. (jl4p we have 



SNR = SNRi • Vn Vi = 1, 
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(16) 



(17) 



This means that working with 5n-vectors the signal-to-noise ratio increases as in the case of 
a coherent analysis over the total observation time. This result can be easily generalized to 
the case in which datasets of different length, sensitivity or belonging to different detectors are 
considered. For instance, let us consider two datasets with different sensitivities, i.e. with two 
different values of variance o"^, al for the components of the corresponding 5- vectors, and two 
different pairs of signal templates (A^, A^) and (Ag", Ag ) (different because corresponding to 
different detectors and/or different observation periods). In this case the variance of the signal 
amplitude estimation is given by 
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(18) 



It can be shown that also the probability distribution of the detection statistics is equal to that 
we would have in case of 5-vector analysis over the whole observation time. The estimators of 
the signal complex amplitudes are given by 
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(19) 



From these equations in case of detection we can estimate signal parameters exactly in the 
same way as in the n = 1 case. We also see that each data 5-vector 'interacts' only with the 
corresponding templates. 

The practical usefulness of working with 5n- vectors is clear when we want to combine data 
from different detectors, but it can be appreciated also when we have two or more datasets 
of the same detector. In this case the advantage of using 5n- vectors comes mainly from the 
properties of the coherence, defined by Eq. (jlOp . We have numerically found that the probability 
distribution of the coherence computed over 5n- vectors in presence of gaussian noise alone is a 
/3 distribution with parameters ai = 2, 02 = 5n — 2: 



p{c; ai = 2,02 = 5n — 2) 



(5re-2)(5ra-l)c-(l-c) 
B{ai = 2,02 = 5n-2) 



5n— 3 



(20) 



which for n = 1 reduces to Eq. lfTT 
than a given threshold cthr is 



The probability of obtaining a value of coherence larger 



P{c > Cthr) = (5n - 1)(1 - Cthr 
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(21) 



and then it is a decreasing function of the number n of datasets being considered. This is shown 
in Fig.([T]) where Eq. ([2T]) is plotted for n = 1, 2, 3. For instance, the probability of getting a value 
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Figure 1 . Probability to have a value of the coherence larger than a given threshold cthr for a 
5n-vector analysis with different values n of the number of datasets. 



of the coherence larger than cthr = 0.8 is ~ 0.03 for n = 1, and decreases to ~ 10~^ for n = 2 
and to ~ 10~^ for n = 3. The computation of the coherence after a 5n-vector analysis is then a 
powerful tool to reject false candidates produced by noise fluctuations. 

We have tested the 5n-vector method by analyzing data in which a simulated signal has 
been added to several realizations of white Gaussian noise. Tab.([T|) shows the mean value and 
variance of the distribution of estimated signal parameters for a set of 500 realizations of noise 
(with output SNR ~ 15). Columns 1 and 2 refer to the 5-vector analysis of each of the two 
datasets, which have the same length and same sensitivity in this example. Column 3 refers 
to the 5n-vector analysis over the two periods (n=2 in this case) while column 4 refers to the 
5-vector analysis over the total dataset. We see that the results of the 5n- vector method are 
identical to those obtained coherently analyzing the whole observation period. On the other 
hand, while also in the case of analysis over the single periods the mean value of estimated 
parameters is very near to the injected ones, due to the high SNR we have in this example, the 
variance of the estimations is larger. In fact, it decreases as the total observation time and so, 
as expected, for the 5n-vector analysis and for the coherent analysis over the total dataset it 
reduces by a factor of about 2. 

In Tab. ([2]) we show the results of the same kind of analysis for various values of SNR. The gain 
in jointly analyzing different datasets with the 5n- vector method is more evident for moderate or 
low signal-to-noise ratio, when the bias and variance of the estimations in the 5n- vector analysis 
are much smaller than that of the 5-vector analyses on the single datasets. For instance, for 
SNR = 5 the average relative error on the amplitude Hq is of about 11.8% in the analysis of 
the single datasets, while it is of about 3.4% for the 5n-vector analysis. Similarly, for rj we have 
a relative error of about 15% in the former case and 6.2% in the latter case. In Fig. ([2]) the 
distribution of estimated parameters for the 5-vector analysis over a single dataset and for the 
5n-vector analysis over two datasets are shown for SNR = 5. The smaller bias and variance of 
the estimations with the 5n-vector method are clearly visible. 



Table 1. Parameter estimation in the analysis of simulated data. The simulation consisted in 
the injection of a simulated signal into 500 realizations of white Gaussian noise. Injections were 
done into two datasets of length 100 days assumed to belong to the Virgo detector. The noise 
level was the same (output SNR ~ 15). The injected parameters were: Hq = 1, r^ = 0.16, ip = 
24.439°. In columns 1 and 2 the mean value and variance of the estimated parameters obtained 
with the 5-vect analysis applied separately to the two datasets are given. In column 3 the results 
for the 5n-vect analysis over the two periods are given (with n=2 in this case). Column 4 refers 
to the 5-vector analysis of the whole period covered by the two datasets. 
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Figure 2. Distribution of estimated Hq and r] for single period analysis (blue, dashed line) and 
for the 5n-vector joint analysis (green, solid line) in the case SNR = 5, see Tab.([2|). Vertical 
red lines indicate the injected parameters. Note that the estimation of parameters over a single 
period is clearly more biased and has a larger variance respect to the result of the joint analysis. 



4. Further developments 

The next step to test the method will be the analysis of real data from Virgo and LIGO detectors, 
searching for software and hardware injected simulated signals. After this validation it will be 
ready for the search of real signals. A priority will be the joint analysis of Virgo VSR2 and 
VSR4 and of LIGO S6 data targeted to the Crab pulsar, for which the sensitivities could allow 
to improve the best published upper limit |12] . 

Another relevant issue to be addressed is the possible presence of a phase discontinuity in 
the signal to be detected. This could be due, for instance, to a glitch of the emitting source, 
like those observed in several pulsars. Glitches are sudden spin-up of a neutron star followed by 
a long term relaxation period. The mechanism at the base of glitches is still poorly known but 
the most popular models invoke quakes in the neutron star crust [13J or the transfer of angular 
momentum from the superfluid to the inner crust ^3] • If this happens we can always think to 
split the data into two datasets corresponding respectively to times before and after the glitch. 
In this case, however, the direct application of the 5n-vector analysis would not work due to the 
relative phase of the signal in the datasets. If the phase jump can be accurately estimated, e.g. 
from radio-astronomical observations, then it is easy to properly shift the signal templates in 



Table 2. Parameter estimation in the analysis of simulated data. The simulation consisted 
in the injection of a simulated signal into 500 realizations of white Gaussian noise for different 
values of SNR. Injections were done into two datasets both of length 100 days: one assumed 
to belong to the Virgo detector, the other belonging to LIGO Livingston. The noise level was 
the same in the two cases. The values of injected parameters are the same as in Tab.([T]). In 
the first two tables the mean value and variance of the estimated parameters obtained with the 
5-vector analysis applied separately to the two datasets are given. In the third table the results 
for the 5n- vector analysis over the two periods are reported (with n=2 in this case). Values for 
the polarization angle are not shown to keep the table simpler. 
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order to correct it. If the phase shift is not known a simple 'brute force' approach could be to 
treat it as a free parameter and estimate it by numerically maximizing the total coherence. More 
sophisticated solutions need a detailed study. This issue will be discussed in detail elsewhere. 
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